CS 6332: Fall 2008 Systems for Large Data Review
نویسنده
چکیده
MapReduce [10] gives us an appropriate model for distributed parallel computing. There are several features which are proved useful: 1) centralized job distribution. 2) Fault tolerance mechanism for both masters and workers. Although there is controversies about MapReduce capability to replace standard RDBMS [12, 13], it is reasonable that existing proposals to use MapReduce in relational data processing [39] do not manipulate very well for complicated queries. Besides, MapReduce itself is not really user-friendly for most programmers and therefore may need some additional specific language or systems for its easy usage [30].
منابع مشابه
System Monitor for Network of Workstations
* E-mail: {isaacc, ashsu}@cs.berkeley.edu This work is done as a term project for CS 262, taught by Professor Eric Brewer, Fall 1996. An experimental information management system is developed to monitor system data for a network of workstations, a heterogeneous computing environment consisting of more than a hundred machines. The information system is designed to help system administration and...
متن کاملSearches for SUSY with the ATLAS detector
We present a review of the SUSY search strategies in ATLAS in conjunction with a readiness of the detector systems for first collision data in 2009 fall. Commissioning was performed with the LHC single beams and the cosmic ray data in 2008. The talk covers the analysis strategies based on the large Emiss T plus high pT multi-jets signature in which the number of methods are investigated to extr...
متن کامل